Interactive Clustering with a High-Performance ML Toolkit
نویسندگان
چکیده
Clustering is a class of machine learning algorithms which has important applications in many different fields. Users often use clustering to find hidden structures from data for those domain specific problems. However, evaluating clustering results is always a hard problem. In many and perhaps most of these applications, users need to trade off competing goals and encode prior knowledge into the model to define what is the best result. The learning algorithm however has evolved around the optimization of a single, usually narrowly-defined criterion, which may not obtain satisfactory results. In most cases, an expert makes trade-offs between different criteria which requires high-level (human) intelligence. This motivates us to provide interactive customization and optimization so that the expert can incorporate secondary criteria into the model-generation process in an interactive way. In this demo paper we will demonstrate the techniques we developed to do customized and interactive model optimization for clustering algorithms. The keys to the approach are (i) high-performance training so that non-trivial models can be trained in real-time (using roofline design and GPU hardware), (ii) a machine learning architecture which is modular, and supports primary and secondary loss functions, and (iii) highly-interactive visualization tools that support dynamic creation of visualizations and controls to match the bespoke criteria being optimized.
منابع مشابه
Application Experiences with the Globus
The development of applications and tools for high-performance \computational grids" is complicated by the heterogeneity and frequently dynamic behavior of the underlying resources; by the complexity of the applications themselves, which often combine aspects of supercomputing and distributed computing; and by the need to achieve high levels of performance. The Globus toolkit has been developed...
متن کاملApplication Experiences with the Globus Toolkit
The development of applications and tools for highperformance “computational grids” is complicated by the heterogeneity and frequently dynamic behavior of the underlying resources; by the complexity of the applications themselves, which often combine aspects of supercomputing and distributed computing; and by the need to achieve high levels of performance. The Globus toolkit has been developed ...
متن کاملDesign of an Application Development Toolkit for HPF / Fortran 90 D
The development of eecient application software capable of exploiting available High Performance Computing (HPC) systems is non-trivial and is largely governed by the availability of suuciently high-level languages, tools, and application development environments. In this paper we describe the design and operation of a toolkit for HPF/Fortran 90D application development. The toolkit incorporate...
متن کاملInteractive Form-Generation in High-Performance Architecture Theory
Architecture as a designerly way of thinking and knowing is to interact with its environment. The manuscript is to speculate “interactive form-generation” based on high-performance architecture theory, and discuss the precursors and the potentials. The research aims to explore and determine the roots, aspects of interactive architecture as a part of performance-based design in contemporary arch...
متن کاملFeasibility of using Medical Imaging Interaction Toolkit in volumetric studies to accurate diagnosing of vascular emboli by Extended NURBS-based Cardiac-Torso phantom
Introduction: Important complications of venous thromboembolism (VTE) are a longer hospital stay, readmission, recurrence of the emboli, complications of anticoagulant therapy and death in a sever condition. In present study, the volume measurement accuracy of the medical imaging interaction toolkit (MITK) software on determining VTE in computed tomography images was evaluated....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015